Picture for Anh Tuan Luu

Anh Tuan Luu

Don't Read Everything: A Curvature-Conditioned Query for Linear Attention

Add code
May 31, 2026
Viaarxiv icon

WINDQuant: Weight-Informed Neural Decision-Making for Global Mixed-Precision LLM Quantization

Add code
May 26, 2026
Viaarxiv icon

When In-Distribution Gains Fail: Evaluating Weak-to-Strong Reward Models under Preference Shift

Add code
May 26, 2026
Viaarxiv icon

Understanding and Preventing Entropy Collapse in RLVR with On-Policy Entropy Flow Optimization

Add code
May 12, 2026
Viaarxiv icon

OpenMobile: Building Open Mobile Agents with Task and Trajectory Synthesis

Add code
Apr 16, 2026
Viaarxiv icon

Towards Reliable Truth-Aligned Uncertainty Estimation in Large Language Models

Add code
Apr 01, 2026
Viaarxiv icon

OrchMAS: Orchestrated Reasoning with Multi Collaborative Heterogeneous Scientific Expert Structured Agents

Add code
Mar 03, 2026
Viaarxiv icon

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Add code
Jan 15, 2026
Viaarxiv icon

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Add code
Jan 13, 2026
Viaarxiv icon

MRMR: A Realistic and Expert-Level Multidisciplinary Benchmark for Reasoning-Intensive Multimodal Retrieval

Add code
Oct 10, 2025
Viaarxiv icon